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ABSTRACT 

We have analysed the angular clustering of X-ray selected active galactic nuclei (AGN) in different 
flux-limited sub-samples of the Chandra Deep Field North (CDF-N) and South (CDF-S) surveys. We 
find a strong dependence of the clustering strength on the sub-sample flux-limit, a fact which explains 
most of the disparate clustering results of different XMM and Chandra surveys. Using Limber's equation, 
we find that the inverted CDF-N and CDF-S spatial clustering lengths are consistent with direct spatial 
clustering measures found in the literature, while at higher flux-limits the clustering length increases 
considerably; for example, at f x ,\ixmt ~ 10~ 15 erg s _1 cm~ 2 we obtain r ~ 17 ± 5 and 18 ± 3 h- 1 Mpc, 
for the CDF-N and CDF-S, respectively. We show that the observed flux- limit clustering trend hints 
towards an X-ray luminosity dependent clustering of X-ray selected, z ~ 1, AGNs. 

Subject headings: galaxies: active — quasars: general — surveys — cosmology: observations — 
large-scale structure of the universe 



1. INTRODUCTION 

X-ray selected AGNs provide a relatively unbiased cen- 
sus of the AGN phenomenon, since obscured AGNs, 
largely missed in optical surveys, are included in such sur- 
veys. Furthermore, they can be detected out to high red- 
shifts and thus trace the distant density fluctuations pro- 
viding important constraints on supermassive black hole 
formation, the relation between AGN activity and Dark 
Matter (DM) halo hosts, the cosmic evolution of the AGN 
phenomenon (eg. Mo & White 1996, Sheth et al. 2001), 
and on cosmological parameters and the dark-energy equa- 
tion of state (eg. Basilakos & Plionis 2005; 2006, Plionis 
& Basilakos 2007). 

Until quite recently our knowledge of X-ray AGN clus- 
tering came exclusively from analyses of ROSAT data 
(< 3keV) (eg. Boyle & Mo 1993; Vikhlinin & Forman 
1995; Carrera et al. 1998; Akylas, Georgantopoulos, Plio- 
nis, 2000; Mullis et al. 2004). These analyses provided 
conflicting results on the nature of high-z AGN cluster- 
ing. Vikhlinin & Forman (1995), using the angular cor- 
relation approach and inverting to infer the spatial corre- 
lation length, found a strong amplitude of z ~ 1 sources 
(ro — 9 h^ 1 Mpc), which translates into tq ~ 12 h^ 1 Mpc 
for a ACDM cosmology and a luminosity driven density 
evolution (LDDE) luminosity function (eg. Hasinger et al. 
2005). Carrera et al. (1998), however, using spectroscopic 
data, could not confirm such a large correlation amplitude. 

With the advent of the XMM and Chandra X-ray obser- 
vatories, many groups have attempted to settle this issue. 
Recent determinations of the high-z X-ray selected AGN 
clustering, in the soft and hard bands, have provided again 
a multitude of conflicting results, intensifying the debate 
(eg. Yang et al. 2003; Manners et al. 2003; Basilakos et 
al. 2004; Gilli et al. 2005; Basilakos et al 2005; Yang et 
al. 2006; Puccetti et al. 2006; Miyaji et al. 2007; Gandhi 
et al. 2006; Carrera et al. 2007). 



In this letter we investigate these clustering differences 
by re-analysing the CDF-N and CDF-S surveys, using the 
Bauer et al. (2004) classification to select only AGNs in 
the 0.5-2 and 2-8 keV bands. To use all the available 
sources, and not only those having spectroscopic redshifts, 
we work in angular space and then invert the angular cor- 
relation function using Limber's equation. Hereafter, we 
will be using h = H o /100 km s^ 1 Mpc -1 . 



2. THE X-RAY SOURCE CATALOGUES 

The 2Ms CDF-N and lMs CDF-S Chandra data rep- 
resent the deepest observations currently available at X- 
ray wavelengths (Alexander et al. 2003, Giaconni et al. 
2001). The CDF-N and CDF-S cover an area of 448 and 
391 arcmin 2 , respectively. We use the source catalogues of 
Alexander et al. (2003) for both CDF-N and CDF-S. The 
flux limits that we use for the CDF-N are 3 x 10~ 17 and 
2 x 10~ 16 erg cm~ 2 s _1 in the soft and hard band, while for 
the CDF-S the respective values are 6x 10 -17 and 5x 10~ 16 
erg cm~ 2 s . Note that sensitivity maps were produced 
following the prescription of Lehmer et al. (2005) and in 
order to produce random catalogues in a consistent man- 
ner to the source selection, we discard sources which lie 
below our newly determined sensitivity map threshold, at 
their given position. Our final CDF-N catalogues con- 
tain 383 and 263 sources in the soft (0.5-2 keV) and hard 
band (2-8 keV) , respectively, out of which 304 and 255 are 
AGNs, according to the "pessimistic" Bauer et al. (2004) 
classification. The corresponding CDF-S catalogues con- 
tain 257 and 168 sources in the same bands, out of which 
227 and 165 are AGNs. A number of sources (roughly half) 
have spectroscopic redshift determinations (mostly taken 
from Barger et al. 2003; Szokoly et al. 2004; Vanzella 
et al. 2005; Vanzella et al. 2006; Le Fevre et al. 2004; 
Mignoli et al. 2005). 
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3. CORRELATION FUNCTION ANALYSIS 

3.1. The angular correlation 

The clustering properties of the X-ray AGNs are esti- 
mated using the two-point angular correlation function, 
w(6), estimated using w(6) — /(Ndd/Ndr) — 1, where 
Ndd and Ndr is the number of data-data and data- 
random pairs, respectively, within separations 9 and 9+d9. 
The normalization factor is given by / = 2Nn/(Nrj — 1), 
where Nd and Nr are the total number of data and ran- 
dom points respectively. The Poisson uncertainty in w{9) 
is estimated v / (l+w(9))/N DR (Peebles 1973). 

The random catalogues are produced to account for the 
different positional sensitivity and edge effects of the sur- 
veys. To this end we generated 1000 Monte Carlo random 
realizations of the source distribution, within the CDF-N 
and CDF-S survey areas, by taking into account the lo- 
cal variations in sensitivity. We also reproduce the desired 
log N — log S distribution, either the Kim et al. (2007) or 
the one recovered directly from the CDF data (however 
our results remain mostly unchanged using either of the 
two). Random positioned sources with fluxes lower than 
that corresponding to the particular position of the sensi- 
tivity map are removed from our final random catalogue. 

We apply the correlation analysis evaluating w(9) in the 
range [5 ,900 ] in 10 logarithmic intervals with 5\og9 = 
0.226. We find statistically significant signals for all bands 
and for both CDF-N and CDF-S. As an example we 
provide in Table 1 the integrated signal to noise ratios, 
given for two different flux-limits and two different angu- 
lar ranges. The significance appears to be low only for the 
CDF-S hard-band, but for the lowest flux-limit. 

The angular correlation function for two different flux- 
limits are shown in Figure 1, with the lines corresponding 
to the best-fit power law model: w{&) = (9q/9) 1 ~ 1 , us- 
ing 7 = 1.8 and the standard y 2 minimization procedure. 
Note that for the CDF-N, we get at some 0's very low or 
negative w{9) values. These, however, are taken into ac- 
count in deriving the integrated signal, presented in Table 
1. 

Applying our analysis for different flux-limited sub- 
samples, we find that the clustering strength increases with 
increasing flux-limit, in agreement with the CDF-S results 
of Giacconi et al (2001). In Figure 2 we plot the angu- 
lar clustering scale, 9q, derived from the power-law fit of 
w(9), as a function of different sample flux- limits. The 
trend is true for both energy bands and for both CDF-N 
and CDF-S, although for the latter is apparently stronger. 

We also find that at their lowest respective flux-limits 
the clustering of CDF-S sources is stronger than that of 
CDF-N (more so for the soft-band), in agreement with 
the spatial clustering analysis of Gilli et al. (2005). This 
difference has been attributed to cosmic variance, in the 
sense that there are a few large superclusters present in 
the CDF-S (Gilli et al. 2003). However, selecting CDF- 
N and CDF-S sources at the same flux-limit reduces this 
difference, which remains strong only for the highest flux- 
limited sub-samples (see Fig. 2). 

It is worth mentioning that our results could in princi- 
ple suffer from the so-called amplification bias, which can 
enhance artificially the clustering signal due to the detec- 
tor's PSF smoothing of source pairs with intrinsically small 



angular separations (see Vikhlinin & Forman 1995; Basi- 
lakos et al. 2005). However, we doubt whether this bias 
can significantly affect our results because at the median 
rcdshift of the sources (z ~ 1) the Chandra PSF angular 
size of ~ 1 corresponds to a rest-frame spatial scale of 
only ~ 5 h^ 1 kpc (even at large off-axis angles, where the 
PSF size increases to ~ 4 , the corresponding spatial scale 
is only ~ 20 h~ x kpc). In any case, and ignoring for the 
moment the additional effect of the variable PSF size, the 
above imply that only the w(9) of the lowest flux-limited 
samples could in principle be affected, but in the direction 
of reducing (and not inducing) the observed 9q — /x, limit 
trend (since the uncorrected 9q values are, if anything, ar- 
tificially larger than the true underlying one). 

However, the variability of the PSF size through-out the 
Chandra field can have an additional effect, and possibly 
enhance or even produce the observed 9q — /x, limit trend. 
To test for this we have repeated our analysis, restricting 
the data to a circular area of radius 6 around the center 
of the Chandra fields, where we expect to have a relatively 
small variation of the PSF size. This choice of radius was 
dictated as a compromise between excluding as much ex- 
ternal area as possible but keeping enough sources (<J 50% 
of original) to perform the clustering analysis. The results 
show that indeed the 9q — /x, limit trend is present and qual- 
itatively the same as when using all the sources, implying 
that the previously mentioned biases do not create the ob- 
served trend. 

3.2. Comparison with other w(9) results 

We investigate here whether the large span of published 
X-ray AGN clustering results can be explained by the de- 
rived 9 — /x, limit trend. To this end we attempt to take 
into account the different survey area-curves, by estimat- 
ing a characteristic flux for each survey, f x (^AC), corre- 
sponding to half its area-curve (easy to estimate from the 
different survey published area-curves). 

In Figure 3 we plot the corresponding values of 9$ (for 
fixed 7 = 1.8) as a function of f x (^AC) (for both hard 
and soft bands) for the Chandra Large AREA Synoptic X- 
ray survey (CLASXS) (Yang et al. 2003), the XMM/2dF 
(Basilakos et al. 2004; 2005), XMM- COSMOS (Miyaji et 
al. 2007), XMM-ELAIS-S1 (Puccetti et al. 2006), XMM- 
LSS (Gandhi et al. 2006) and AXIS (Carrcra et al. 2007) 
surveys. With the exception of the Yang et al. (2003) 
and the Carrera et al. (2007) hard-band results, the rest 
are consistent with the general flux-dependent trend. Note 
also that Gandhi et al. (2006) do not find any significant 
clustering of their hard-band sources. Of course, cosmic 
variance is also at work (as evidenced also by the cluster- 
ing differences between the CDF-N and CDF-S; see Fig. 2 
and Gilli et al. 2005) which should be responsible for the 
observed scatter around the main trend (see also Stewart 
et al. 2007). 

We would like to stress that the CDF surveys have a 
large flux dynamical range which is necessary in order to 
investigate the / x , limit — do correlation. This is probably 
why this effect has not been clearly detected in other sur- 
veys, although recently, a weak such effect was found also 
in the CLASXS survey (Yang et al. 2006). 

3.3. The spatial correlation length using w(9) 
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We can use Limber's equation to invert the angular 
clustering and derive the corresponding spatial clustering 
length, r (eg. Peebles 1993). To do so it is necessary to 
model the spatial correlation function as a power law and 
to assume a clustering evolution model, which we take to 
be that of constant clustering in comoving coordinates (eg. 
de Zotti et al. 1990; Kundic 1997). For the inversion to 
be possible it is necessary to know the X-ray source red- 
shift distribution, which can be determined by integrating 
the corresponding X-ray source luminosity function above 
the minimum luminosity that corresponds to the partic- 
ular flux-limit used. To this end we use the Hasinger et 
al. (2005) and La Franca et al. (2005) LDDE luminosity 
functions for the soft and hard bands, respectively. 

We perform the above inversion in the framework of the 
concordance ACDM cosmological model (f2 m = 1 — J1a = 
0.3) and the comoving clustering paradigm. The resulting 
values of the spatial clustering lengths, ro, show the same 
dependence on flux-limits, as in Fig. 2. 

We can compare our results with direct determinations 
of the spatial-correlation function from Gilli et al. (2005), 
who used a smaller (~ 50%) spectroscopic sample from 
the CDF-N and CDF-S. They found a significant differ- 
ence between the CDF-S and CDF-N clustering, with 
r = 10.3 ± 1.7 h- 1 Mpc and r = 5.5 ± 0.6 h' 1 Mpc, 
respectively (note also that the corresponding slopes were 
quite shallow, roughly 7 ~ 1.4 — 1.5). Since, Gilli et al. 
used sources from the full (0.5-8 keV) band, we compare 
their results with our soft-band results which, dominate 
the total-band sources. This comparison is possible, be- 
cause as we have verified using a Kolmogorov-Smirnov 
test, the flux distributions of the sub-samples that have 
spectroscopic data are statistically equivalent with those 
of the whole samples. Our inverted clustering lengths, for 
the lowest flux-limit used, are: ro = 10.3 ±2 h^ 1 Mpc and 
r = 6.4 ± 2.5 h' 1 Mpc (fixing 7 = 1.8) for the CDF-S 
and CDF-N respectively, in good agreement with the Gilli 
et al. (2005) direct 3D determination^. 

Our values can be also compared with the re-calculation 
of the CDF-N spatial clustering by Yang et al. (2006), who 
find r ~ 4.1 ± 1.1 /i" 1 Mpc. 

We return now to the strong trend between 0$ (or the 
corresponding ro) and the sample flux-limit (see Fig. 2 and 
3), which could be due to two possible effects (or the com- 
bination of both). Either the different flux- limits corre- 
spond to different intrinsic luminosities, ie., a luminosity- 
clustering dependence (see also hints in the CLASXS and 
CDF-N based Yang et al. 2006 results; while for optical 
data see Porciani & Norberg 2006) or a redshift-dependent 
effect (ie., different flux- limits correspond to different red- 
shifts traced). Using the sources which have spectroscopic 
rcdshift determinations we have derived their intrinsic lu- 
minosities, in each respective band, from their count rates 
using a spectral index Y = 1.9 and the concordance ACDM 
cosmology. We have also applied an absorption correction 
by assuming a power-law X-ray spectrum with an intrin- 
sic r = 1.9, obscured by an optimum column density to 



reproduce the observed hardness ratio. We then derive, 
for each flux-limit used, the median redshift and median 
luminosity of the corresponding sub-sample. We find rel- 
atively small variations and no monotonic change of the 
median redshift with subsample flux-limit. For example, 
the median spectroscopic redshift for the soft and hard 
bands, at the lowest flux-limit used, is z ~ 0.8 and ~ 0.95, 
respectively, while its mean variation between the differ- 
ent flux-limits used is (Sz/z) ~ —0.11 and -0.03 for the 
CDF-N and (Sz/z) ~ -0.27 and 0.05 for the CDF-S soft 
and hard-bands, respectively. The large redshift variation 
of the soft-band CDF-S data should be attributed to the 
presence of a few superclusters at z ~ 0.7; see Gilli et al. 

2003) . 

In Figure 4 we present the correlation between the sub- 
sample median X-ray luminosity and the corresponding 
subsample clustering length, as provided by Limber's in- 
version. Although the CDF luminosity dynamical range is 
limited, it is evident that the median X-ray luminosity sys- 
tematically increases with increasing sample flux-limit and 
it is correlated to ro (as expected from Fig. 2). It should be 
noted that the correlation length of the highest-flux lim- 
ited CDF-S soft-band subsample is by far the largest ever 
found (~ 30/i _1 Mpc), but one has to keep in mind that 
the CDF-S appears not to be a typical field, as discussed 
earlier (see Gilli et al. 2003). The CDF-N high-flux results 
appear to converge to a value of ro ~ 18ft- -1 Mpc, similar 
to that of some other surveys (eg. Basilakos et al. 2004; 
2005 and Puccetti et al. 2007). 

We therefore conclude that not only are there indica- 
tions for a luminosity dependent clustering of X-ray se- 
lected high-z AGNs, but also that they are significantly 
more clustered than their lower- z counterparts, which have 
r ~ 7 - 8 h' 1 Mpc (eg. Akylas et al. 2000; Mullis et al. 

2004) . This is a clear indication of a strong bias evolution 
(eg. Basilakos, Plionis & Ragone-Figueroa 2007). 

4. CONCLUSIONS 

We have analysed the angular clustering of the CDF-N 
and CDF-S X-ray AGNs and find: 

(1) A dependence of the angular clustering strength on 
the sample flux-limit. Most XMM and Chandra cluster- 
ing analyses provide results that are consistent with the 
observed trend; a fact which appears to lift the confusion 
that arose from the apparent differences in their respective 
clustering lengths. 

(2) Within the concordance cosmological model, the co- 
moving clustering evolution model and the LDDE lumi- 
nosity function, our angular clustering results are in good 
agreement with direct estimations of the CDF-N and CDF- 
S spatial clustering, which are based however on roughly 
half the total number of sources, for which spectroscopic 
data were available. 

(3) The apparent correlation between clustering strength 
and sample flux-limit transforms into a correlation be- 
tween clustering strength and intrinsic X-ray luminosity, 
since no significant redshift-dependent trend was found. 



Leaving both ro and 7 as free parameters in the fit, we obtain 7's quite near their nominal value (7 ~ 1.6 — 1.8) 
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Table 1 

The integrated angular clustering signal. 



Sample limit" fx, limit'' 





< 400 


< 900 


< 400 


< 900 


CDF-N soft 


2.1a 


O.Ocr 


2.1a 


1.5a 


CDF-N hard 


3.3a 


2.6a 


6.7a 


5.5a 


CDF-S soft 


4.2er 


2.6a 


3Aa 


3.7a 


CDF-S hard 


0.3a 


1.3a 


4.1(7 


2.7a 



a the lowest flux- limit. 

b 5 x 10~ 16 erg s _1 cm~ 2 (soft band) and 10~ 15 erg s _1 cm~ 2 (hard band). 
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Fig. 1. — The CDF-S and CDF-N angular correlation function. The open points correspond to the overall sample, while the filled points 
to the highest flux-limit used (f x = 3 X 10 -15 erg/s/cm 2 for the soft and f x = 5 X 10 -15 erg/s/cm 2 for the hard bands, respectively). The 
straight lines correspond to the best power-law fit to the clustering data. Errorbars correspond to la Poisson uncertainties. 
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Fig. 2. — The angular clustering scale as a function of the flux-limit of the different samples. The dependence of clustering strength to the 
flux-limit is evident. The left and right panels correspond to the hard and soft bands respectively. Filled symbols correspond to the CDF-N 
while open ones to the CDF-S. 
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Fig. 3. — The angular correlation scale, 8o, as a function of different survey characteristic flux, defined as that corresponding to half the 
respective survey area-curves. Most results appear to be consistent with the clustering flux-limit dependence, found from the CDF-N and 
CDF-S. Note that in the left panel we plot the 4.5-fO keV results of Miyaji et al. (2007). Errorbars correspond to lcr Poisson uncertainties. 
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Fig. 4. — The correlation between the clustering length, ro, and the median intrinsic X-ray luminosity of each subsample. The left and 
right panels correspond to the hard and soft bands respectively. Filled symbols correspond to the CDF-N while open ones to the CDF-S. 



